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Ala Cys Leu Leu Arg Leu Gly Thr 
-5 

His Thr Gly Ala Ser His Ala Ala 
10 15 

Leu Gly Gly Gly Ser Gly Gly lie 
25 30 

Lys Val Gly Ala Glu Asn Val Ala 
40 45 

Phe Tyr Gin Pro lie Trp Thr Leu 
60 

Ser Ser Ser Gly Arg Pro Thr Ala 

75 

Trp lie Lys Ala Arg Val Thr Glu 
90 95 



Gin Gin Val Gly Pro Leu Gin Leu 
1 5 

Arg Asn His Tyr Glu Val Leu Val 

20 

Thr Met Ala Ala Arg Met Lys Arg 
35 

lie Val Glu Pro Ser Glu Arg His 
50 55 

Val Gly Ala Gly Ala Lys Gin Leu 
65 70 

Ser Val lie Pro Ser Gly Val Glu 
80 85 

Leu Asn Gin Thr Arg Leu His His 
100 



Thr Asp Asp Asp Gly 
105 



(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Umbilical cord 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -86,. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.5 

seq ALLTGPTLGSSQA/RW 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 



Met Ser Glu Met Ala Glu 
-85 

Leu Gin Met Aso Val Met 
-70 * -65 

Val Gly Ser Gly Ser Arg 
-50 

Ala Gin Gin Leu Glu Glu 
-35 



Leu Ser Glu Leu Tyr 
-80 

Pro Gly Glu Gly Asp 
-60 

Glu Leu Ser Leu Arg 
-45 

Glu Gly Pro Met Glu 
-30 



Glu Glu Ser Ser Asp 
-75 

Leu Pro Gin Met Glu 
-55 



Pro Ser Arg Ser Gly 
-40 

Glu Glu Glu Ala Gin 
-25 
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Pro Met Ala Xaa Gin Arg Gly Asn Gly Ala Leu Leu Thr Gly Pro Thr 
-20 -15 «10 

Leu Gly Ser Ser Gin Ala Arg Trp Arg Ala Xaa Thr Ser Arg Ala Arg 

15 10 

Thr Arg Ala Pro Gly 
15 



(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F> TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -15.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5.3 

seq IVSVLALIPXTTT/LT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 



Met Leu lie Val Ser 
-15 

Thr Val Gly Gly Gly 
5 

Ala Asp Gly Gly Gly 
20 



Val Leu Ala Leu lie Pro 
-10 -5 

Val Phe Ala Xaa Val Thr 
10 



Xaa Thr Thr Thr Leu 

1 

Ala Val Cys Cys Leu 
15 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 
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. (B) LOCATION: -29. . -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 5 

seq ELSLLPSSLWVLA/TS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 



Met Thr Cvs Arg Gly Ser Cys Ser Tyr Ala Thr Arg Arg Ser Pro Ser 

-25 -20 -15 

Glu Leu Ser Leu Leu Pro Ser Ser Leu Trp Val Leu Ala Thr Ser Ser 

-10 -5 1 



Pro Thr lie Thr lie Ala Leu Ala Met Ala Ala Gly Asn Leu Cys Pro 
5 10 15 



Leu Arg 
20 



(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Umbilical cord 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -15.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.9 

seq AVVFVFSLLDCCA/LI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



Met Glu Ala Val Val Phe Val Phe 
-15 -10 

lie Phe Leu Ser Val Tyr Phe lie 
5 

Asp Tyr lie Asn Ala Arg Ser Cys 
20 25 

lie Pro Glu Leu lie Gly His Thr 
35 40 



Ser Leu Leu Asp Cys Cys Ala Leu 
-5 1 

lie Thr Leu Ser Xaa Leu Glu Cys 
10 15 

Cys Ser Lys Leu Asn Lys Trp Val 
30 

He Gly 



(2) INFORMATION FOR SEQ ID NO: 319: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 57 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -30. .-1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.9 

seq VAHALSLPAQSYG/ND 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

Met Ala Ala Thr Ser Gly Thr Asp Glu Pro Val Ser Gly Glu Leu Val 
-30 -25 -20 -15 

Ser Val Ala His Ala Leu Ser Leu Pro Ala Gin Ser Tyr Gly Asn Asp 
-10 -5 1 

Pro Asp lie Glu Met Ala Trp Ala Met Arg Ala Met Gin His Ala Glu 
5 10 15 

Val Tyr Tyr Lys Leu lie Ser Ser Val 
20 25 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 amino acids 
(3) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -17.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFOEIMATION: score 4.6 

seq ALFLLLHNEMVSG/VY 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

Met Ala Asp Glu Ala Leu Phe Leu Leu Leu His Asn Glu Met Val Ser 
-15 -10 -5 
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Giy Val Tyr Lys Ser Ala Xaa Xaa Gly Arg Trp Lys Thr Asp Asp Val 
15 10 15 

Leu Leu Ser Trp Lys Thr Trp Gly Phe Glu Trp Asp 
20 25 



(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 6 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Lymphocytes 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -119.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.3 

seq SVCLSIISMLSSC/KE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 



Met Ala Ser Met Gin Lys Arg Leu Gin Lys Glu Leu Leu Ala Leu Gin 
-115 -110 -105 

Asn Asp Pro Pro Pro Giy Met Thr Leu Asn Giu Lys Ser Val Gin Asn 
-100 -95 -90 

Ser lie Thr Gin Trp lie Val Asp Met Glu Giy Ala Pro Gly Thr Leu 
-85 -80 -75 

Tyr Giu Giy Glu Lys Phe Gin Leu Leu Phe Lys Phe Ser Ser Arg Tyr 
-70 -65 -60 

Pro Phe AsD Ser Pro Gin Val Met Phe Thr Gly Glu Asn lie Pro Val 
-55 ' -50 -45 -40 

His Pro His Val Tyr Ser Asn Gly His lie Cys Leu Ser lie Leu Thr 
-35 -30 -25 

Glu Asp Trp Ser Pro Ala Leu Ser Val Gin Ser Val Cys Leu Ser lie 
-20 -15 -10 

lie Ser Met Leu Ser Ser Cvs Lys Glu Lys Arg Arg Pro Pro 
-5 ' 1 5 



INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 9 amino acids 
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(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -27.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.1 

seq VLMFCVTPPELET/KX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 



Met Lys Xaa Met Thr Gly Ser Glu 
-25 -20 

Met Phe Cys Val Thr Pro Pro Glu 
-10 -5 

Lys Gly Gly Leu Val Leu Phe Xaa 
10 

Glu Leu Ser Lys Lys lie Ala Glu 
25 



Asn Trp Lys Thr Lys Lys Val Leu 
-15 

Leu Glu Thr Lys Xaa Asn lie Thr 
1 5 

Ala Asn Ser Asn Ser Ser Cvs Met 
15 20 

Arg Pro Ala 
30 



(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Lymphocytes 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -33.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 4.1 

seq LFMTRTLCSPGPS/QP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 



Met Gin His He Val Gly Val Pro His Val Leu Val Arg Arg Gly Leu 
-30 -25 -20 



Leu Gly Arg Asp Leu Phe Met Thr Arg Thr Leu Cys Ser Pro Glv Pro 
-15 -10 -5 
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Ser Gin Pro Arg Glu Ala Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig__peptide 

(B) LOCATION: -19.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

seq ALALASSQSHLLG/RD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 



Met Tyr His Gin Ser Glu Ala Leu 
-15 

Leu Leu Gly Arg Asp Ser Pro Ser 

1 5 

Asn Lvs Glu Met Ser Lys Glu Trp 
15 20 

Thr Phe Thr Ser Phe Gin Ser Val 
30 35 

Lys Asp" Thr Trp 



Ala Leu Ala Ser Ser Gin Ser His 
-10 -5 

Ala Val Phe Glu Gin Asp Leu Glu 
10 

Phe Leu Phe Asn Asp Ser Arg Val 
25 

Gin Lys lie Thr Ser Arg Phe Pro 
40 45 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Umbilical cord 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -22.. -1 
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(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.8 

seq FASVAMICAIASG/SE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

Met Ser Gly Gin Giy Leu Ala Gly Phe Phe Ala Ser Val Ala Met He 
'20 -15 -10 

Cys Ala He Ala Ser Gly Ser Glu Leu Ser Glu Ser Ala Xaa Gly Tyr 
-5 1 5 10 

Phe He Thr Ala Cys Ala Val He He Leu Thr He He Cys Tyr Leu 
15 20 25 

Gly Leu Pro Arg Gin Gly 
30 



(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -30.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.7 

seq SMMLLTVYGGYLC/SV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 6: 



Met Pro Thr Gly Lys Gin Leu Ala Asp He Gly Tyr Lys Th^ Phe Ser 
-30 -25 -20 -15 

Thr Ser Met Met Leu Leu Thr Val Tyr Giy Gly Tyr Leu Cys Ser Val 
-10 -5 1 

Arg Val Tyr His Tyr Phe Gin Trp Arg Arg Ala Gin Arg Xaa Ala Xaa 
5 10. 15 

Glu Gly 
20 



(2) INFORMATION FOR SEQ ID NO: 327: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -14.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq FPVCLTVTAAVCG/XX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 



Met Phe Pro Val Cys Leu Thr Val Thr Ala Ala Val Cys Gly Xaa Xaa 
-10 -5 1 



Ala Gin 



(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -15.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq VIFFACWRVRDG/LP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 



Met Ser Val lie Phe Phe Ala Cys Val Val Arg Val Arg Asp Gly Leu 

-15 -10 -5 1 

Pro Leu Ser Ala Ser Thr Asp Phe Tyr His Thr Gin .-.sp Phe Leu Glu 

5 10 * 15 

Tro Arg Arg Arg Leu Lys Ser Leu Ala Leu Arg Leu Ala Gin Tyr Pro 

20 25 30 

Gly 
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(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENGE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
(F) TISSUE TYPE: Lymphocytes 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: -68.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq LVLDVVMLLLYLG/IE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 



Met Leu Xaa Gly Gly Leu Lys Met Ala Pro Arg Gly Lys Arg Leu Ser 
-65 -60 -55 

Ser Thr Pro Leu Glu lie Leu Phe Phe Leu Asn Gly Trp Tyr Asn Ala 
-50 -45 -40 

Thr Tyr Phe Leu Leu Glu Leu Phe lie Phe Leu Tyr Lys Gly Val Leu 
-35 -30 -25 

Leu Pro Tyr Pro Thr Ala Asn Leu Val Leu Asp Val Val Met Leu Leu 

-20 -15 -10 -5 

Leu Tyr Leu Gly lie Glu Val lie Arg Leu Phe Phe Gly Thr Lvs Gly 

1 5 10 ' 

Asn Leu Cys Gin Arg Lys Met Pro Arg 
" 15 20 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia ■ 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 
(3) LOCATION: -36.. -1 
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(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 3.5 

seq PALTILHLPGTEG/VA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 



Met lie Gly Gly Gly Arg Trp Asp Pro Pro Gly Ala Gin Ala Pro Ser 
-35 -30 -25 

Ser Gin Ala Phe Pro Arg Arg Pro Ala Leu Thr lie Leu His Leu Pro 
-20 -15 -10 -5 

Gly Thr Glu Gly Val Ala Ser Gin Leu Thr Pro Ala Pro Lys Leu Ser 

15 10 

Ser Ala Ala Gly Trp Leu Glu Val Pro Phe Asp Ala lie Pro Ala Pro 
15 20 25 

Gly 



(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: -28.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 8.4 

seq LLRLLCLLPTGLP/VR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 



Met Val Arg Arg 
-25 

Leu Arg Leu Leu 
-10 

Asp Phe Asn Arg 
5 

Ala lie Leu Arg 



Val Gin Pro Asp 



Cys Leu Leu Pro 
-5 

Gly Thr Asp Asn 
10 

Phe Leu Xaa Ser 
25 



Arg Lys Gin Leu 
-20 

Thr Gly Leu Pro 

lie Thr Val Arg 
15 

Gly 



Pro Leu Val Leu 
-15 

Val Arg Ser Val 
1 

Gin Gly Asp Thr 
20 



(2) INFORMATION FOR SEQ ID NO: 332: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Lymph ganglia 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -19.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 7.4 

seq LVFIIGLVGNLLA/LV 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



Met Pro Leu His Tyr Ser Leu Val Phe lie lie Gly Leu Val Gly Asn 
-15 -10 -5 

Leu Leu Ala Leu Val Val lie Val Gin Asn Arg Lys Lys lie Asn Ser 
15 10 

Thr Thr Leu Tyr Ser Thr Asn Leu Val He Ser Asp He Leu Phe Xaa 
15 20 25 

Thr Val Gly 
30 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(F) TISSUE TYPE: Umbilical cord 

(ix) FEATURE: 

(A) NAME/KEY: sig^peptide 

(B) LOCATION: -30.. -1 

(C) IDENTIFICATION METHOD: Von Heijne matrix 

(D) OTHER INFORMATION: score 6.5 

seq WATLGLLVAGLGG/HD 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: - 333: 

Met Ala Arg Gly Leu Gly Ala Pro His Trp Val Ala Val Gly Leu Leu 
-30 -25 -20 .15 
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Thr Trp Ala Thr Leu Gly Leu Leu 
-10 

Asp Leu His Asp Asp Leu Gin Glu 

5 10 

His Ser His Glu Asp Phe His His 
20 25 

Gly His Xaa His Glu Ser Met 
35 40 



Val Ala Gly Leu Gly Gly His Asp 
-5 1 

Asp Phe His Gly His Ser His Arg 
15 

Gly Xaa Ser His Ala His Gly His 
30 
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1. Claims: Invention 1: claims 1-37 all partially 

Nucleic acid comprising the sequence as in Seq.ID:38, 
complementary sequence, fragments, hybridizing sequences. 
Polypeptide comprising a signal peptide encoded by said 
nucleotide sequence. Vector encoding a fusion protein 
comprising said signal peptide. A method of directing the 
extracellular secretion of a polypeptide by means of said 
vector- Method of importing a polypeptide into a cell by 
means of said signal peptide. A method for making a cDNA 
encoding a secretory protein, partially encoded by said 
nucleotide sequence, corresponding cDNA. Polypeptide encoded 
by said nucleotide sequence, comprising a sequence as in 
Seq.ID:186, method of making said polypeptide. Method of 
obtaining a promoter located upstream of said nucleotide 
sequence, promoter thereof. 



2. Claims: Inventions 2-147: claims 1-37 all partially 

Inventions 2-147: Idem as subject 1 but limited to each of 
the DNA sequences as in Seq. ID:39-185, and corresponding 
polypeptides, where invention 2 is limited to Seq.ID:39 and 

187, invention 3 is limited to Seq.ID:40 and 188 , 

invention 147 is limited to Seq.ID:185 and 333). 

For the sake of conciseness, the first subject matter is 
explicitly defined, the other subject matters are defined by 
analogy thereto. 



INTERNATIONAL SEARCH REPORT 

ortnation on patent family members 



Inter '>nal Application No 

Pn/IB 98/01237 



Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 


EP 0279582 A 


24-08-1988 


AU 


618524 


B 


02-01-1992 




AU 


1178488 


A 

A 


18-08-1988 






AU 


661696 


B 


03-08-1995 






AU 


1941092 


A 


24-09-1992 






CA 


1321764 


A 


31-08-1993 






EP 


0832981 


A 


01-04-1998 






JP 


63309192 


A 


16-12-1988 






US 


5565362 


A 


15-10-1996 






US 


5304489 


A 


19-04-1994 



WO 9634981 


A 


07- 


•11- 


1996 


FR 


2733765 A 


08-11-1996 










FR 


2733762 A 


08-11-1996 












AU 


5982996 A 


21-11-1996 












CA 


2220045 A 


07-11-1996 












EP 


0824598 A 


25-02-1996 


EP 0625572 


A 


23- 


-11- 


•1994 


JP 


6153953 A 


03-06-1994 










WO 


9408001 A 


14-04-1994 












US 


5597713 A 


28-01-1997 


WO 9707198 


A 


27 


-02- 


•1997 


US 


5707829 A 


13-01-1998 










AU 


6712396 A 


18-02-1997 












AU 


6768596 A 


12-03-1997 












CA 


2227220 A 


06-02-1997 












CA 


2229208 A 


27-02-1997 












EP 


0839196 A 


06-05-1998 












EP 


0851875 A 


08-07-1998 












WO 


9704097 A 


06-02-1997 



Fofm PCT/ISA/210 (patent f amity annex) (July 1992) 



<WO 9906553A3 I > 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 




PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification ^ : 

C12N 15/12, C07K 14/47, C12N 15/10, 
15/11 



A3 



(11) International Publication Number: WO 99/06553 

(43) International Publication Date: 1 1 February 1999 (11. 02.99) 



(21) International Application Number: PCT/IB 98/0 1237 

(22) International FiUng Date: 31 July 1998 (31.07,98) 



(30) Priority Data: 
08/905.051 



1 August 1997 (01.08.97) 



US 



(71) Applicant (for all designated States except US): GENSET 

[FR/FR]; 24, me Royale, F-75008 Paris (FR). 

(72) Inventors; and 

(75) Inventors^AppHcants (for US only): DUMAS MILNE ED- 
WARDS, Jean-Baptiste [FR/FR]; 8, me Gr6goire de Tours, 
F-75006 Paris (FR). DUCLERT, Aymeric [FR/FR]; 6 ter, 
me Victorine, F-94100 Saint-Maur (FR). LAC^ROIX, Bmno 
[FR/FR]; 9, route de Vourles. F-69230 Saint-Genls Laval 
(FR). 

(74) Agents: MARTIN, Jean-Jacques et al.; Cabinet Regimbeau, 
26. avenue K16ber, F-75116 Paris (FR). 



(81) Designated States: AL. AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU. CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, GM, HR, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, 
LC, LK, LR, LS. LT, LU, LV, MD, MG, MK, MN, MW, 
MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ. TM, TR, IT, UA, UG, US, UZ, VN, YU, ZW, ARIPO 
patent (GH, GM. KE, LS, MW, SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG. KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, 
IE. IT, LU, MC. NL. PT, SE), OAPI patent (BF, BJ, CF, 
CG, CI. CM. GA. GN. GW, ML, MR, NE, SN. TD, TG). 



Published 

With international search report. 
With amended claims. 

(88) Date of publication of the international search report: 

8 April 1999 (08.04.99) 

Date of publication of the amended claims: 

27 May 1999 (27.05.99) 



(54) Title: 5' ESTs FOR SECRETED PROTEINS EXPRESSED IN VARIOUS TISSUES 



(57) Abstract 

The sequences of 5' ESTs derived from mRNAs encoding secreted proteins are disclosed. The 5' ESTs may be to obtain cDNAs 
and genomic DNAs corresponding to the 5' ESTs. The 5' ESTs may also be used in diagnostic, forensic, gene therapy, and chromosome 
mapping procedures. Upstream regulatory sequences may also be obtained using the 5' ESTs. The 5* ESTs may also be used to design 
expression vectors and secretion vectors. 



BNSDOCID: <WO 9906553A3 IA> 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing intemati(Mial applications under the PCX. 



AL Albania 

AM Aimenia 

AT Austria 

AU Australia 

AZ Azeibaijan 

BA Bosnia and Herzegovina 

BB Barbados 

BE Belgium 

BF Burkina Faso 

BG Bulgaria 

BJ Benin 

BR Brazil 

BY Belarus 

CA Canada 

CF Centra) African Republic 

CG Congo 

CH Switzerland 

CI Cdte d'lvoiie 

CM Cameroon 

CN China 

CU Cuba 

CZ Czech Republic 

DE Germany 

DK Denmark 

EE Estonia 



ES 


Spain 


LS 


Lesotho 


SI 


FI 


Finland 


LT 


Lithuania 


SK 


FR 


France 


LU 


Luxembourg 


SN 


GA 


Gabon 


LV 


Latvia 


SZ 


GB 


United Kingdom 


MC 


Monaco 


TD 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


GH 


Ghana 


MG 


Madagascar 


TJ 


GN 


Guinea 


MK 


Tlie fonner Yugoslav 


TM 


GR 


Greece 




Republic of Macedonia 


TR 


HU 


Hungary 


ML 


Mali 


TT 


IE 


Ireland 


MN 


Mongolia 


UA 


IL 


Israel 


MR 


Mauritania 


UG 


IS 


Iceland 


MW 


Malawi 


US 


IT 


Italy 


MX 


Mexico 


UZ 


JP 


Japan 
Kenya 


NE 


Niger 


VN 


KE 


NL 


Netherlands 


YU 


KG 


Kyrgyzsian 


NO 


Norway 


ZW 


KP 


Democratic People's 


NZ 


New Zealand 






Republic of Korea 


PL 


Poland 




KR 


Republic of Korea 


PT 


Portugal 




KZ 


Kazakstan 


RO 


Romania 




LC 


Saint Lucia 


RU 


Russian Federation 




LI 


Liechtenstein 


SO 


Sudan 




LK 


Sri Lanka 


SE 


Sweden 




LR 


Liberia 


SG 


Sing^Mre 





Slovenia 

Slovakia 

Senegal 

Swaziland 

Chad 

Togo 

Tajikistan 

T^kmenistan 

Turkey 

Trinidad and Tobago 

Ukraine 

Uganda 

United Stales of America 

Uzbekistan 

Viet Nam 

Yugoslavia 

Zimbabwe 



BNSDOCID: <V^O 9906553A3_IA> 



wo 99/06553 1 3 -] 



PCT/IB98/01237 



AMENDED CLAIMS 
[received by the Internationa] Bureau on 8 April 1999 (08.04.99); 
original claims li-37 amended (5 pages)] 

xrr. '^P"^^^°'''«^'«t«*"'^ci«c add comprising the sequence of one of SEQ m 

iNOs. 38-185 or compnsing a sequencecomplementary thereto. 

2. The nucleic acid of Clainl, wherein said nucleic add is recombinant 

3. A punfied or isolated nucleic acid comprising at least 10 consecutive bases of 

ttLr"T f °" ''''' ^-l"-- complememary 

thereto, w^th the, exception of a purified or isolated nucleic acid consisting of consecu^ 
bases whtch are situated entirely in the sequences identified as Feature Ttl 
correspondmg SEQ ro under key:other. 

nn. 1 ^ ''"^'^ comprising at least 15 consecutive bases of 

thetetl °' ^^^^ 38-185 or one of the sequences complementary 

ales whlch - -^^^^ n-^eic acid consisting of consecut.^ 

bases which are situated entirely in the sequences identified as Feature in the 
correspondmg SEQ ID under key;other. 
15 5. The nucldc acid of Claim 4, wherein said nudeic acid is recombinant 

n„H. ^ ^ ^^"'^ '"'^ ^^^^ ^Pable of hybridizing 

under stnngent conditions to the sequence of one of SEQ ID NOs: 38-185 or one of the 
sequences complementary to the sequences of SEQ ID NOs. 38-185, with the exception of 

20 eTf I °' -^^h are situated 

entirely m the sequences iderttified as Feature in the corresponding SEQ ID under 

7. The nucleic add of Claim 6, wherdn said nucleic acid is recombinant 

A punfied or isolated nucleic acid encoding a human gene product said 

NO. 38-185 with the exception of a purified or isolated nudeic add consisting of 
consecutive bases which are situated entirely in the sequences identified as Feature in the 
corresponding SEQ ID under key:other. 



35 



MOc- J\ t °' '"^^^^ ""'^'^'^ the sequence of one of SEQ ID 

JNUs. J8-I85 or having a sequence complementary thereto. 

TDT^o '^°'''°^^«''^'^^^^^«'l<^'»Prisingthenucl^^^^ 
in NOs; 38-185 which encode a signal peptide. 

one of ' ^ ^"^'"^ polypeptides comprising a signal peptide encoded by 

one of the sequences of SEQ ID NOs: 38-1 85. 

n«,riH. ^\ ^ ^ ^"^^^ comprising a polypeptide and- a signal 

peptide, said vector comprise a first nucleic acid encoding a signal peptide encoded by o^ 

poJj^'e^r'' "'"^"^ °P-^'y to - second nucleic acid encolg a 

13^ ^ '"^od of directing the extraceUular secretion of a polypeptide or the 
insertion of a polypetide into the membrane comprising the steps of: 

i^MENDED SHEET (ARTICLE 19) 
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I 



obtaining a vector according to Claim 12; and 

introducing said vector into a host cell 1 such that said fusion protein is secreted into the 
extracellular environment of said host cell br inserted into the membrane of said host cell. 
5 14. A method of importing a polypeptide into a cell comprising contacting said 

cell with a fusion protein comprising a signal peptide encoded by one of the sequences of 
SEQ ID NOs: 38-185 operably linked to said polypeptide. 

15, A method of making a cDNA encoding a human secretory protein that is 
partially encoded by one of SEQ ID NOs SsllSS, comprising the steps of 

1 0 obtaining a cDNA comprising one of the sequences of SEQ ID NOs: 38-185; 

contacting said cDNA with a detectable probe comprismg at least 15 consecutive 
nucleotides of said sequence of SEQ ID NO:\38-185 or a sequence complementary thereto 
under conditions which permit said probe to hybridize to said cDNA; 

identifying a cDNA which hybridizes tolsaid detectable probe; and 
1 5 isolating said cDNA which hybridizes to\said probe. 

16. An isolated or purified cDNA \encoding a human secretory protein, said 
human secretory protein comprising the protein encoded by one of SEQ ID NOs 38-185 or a 
fi-agment thereof of at least 10 amino acids, said^cDNA being obtainable by the method of 
Claim 15. 

20 17. The cDNA of Claim 16 wherein said cDNA comprises the fiill protein coding 

sequence partially included in one of the sequences of SEQ ID NOs: 38-185. 

18. A method of making a cDNA comprising one of the sequences of SEQ ID 
NOs: 38-185, comprising the steps of 

contacting a collection of mRNA molecules, fi'om human cells with a first primer 
25 capable of hybridizing to the polyA tail of said mRNA; 

hybridizing said first primer to said polyA tail; 
reverse transcribing said mRNA to make a first gDNA strand; 
making a second cDNA strand complementary to said first cDNA strand using at 
least one primer comprising at least 15 nucleotides of one of the sequences of SEQ ID NOs 
30 38-185: and 
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isolating the resulting cDNA Comprising said first cDNA strand and said second 
cDNA strand. 

19. An isolated or purified cDNA encoding a human secretory protein, said 
human secretory protein comprising tiie protein encoded by one of SEQ ID NOs 38-185 or a 

5 fragment thereof of at least 10 amino acids, said cDNA being obtainable by the mediod of 
Claim 18. / 

20. The cDNA of Claim'l9 wherein said cDNA comprises the full protein coding 
sequence partially included in one ofthe sequences of SEQ ID NOs: 38-185. 

21. The method of Clai^ 1 8, wherein the second cDNA strand is made by: 
10 contacting said first cDNA/strand with a first pair of primers, said first pair of primers 

comprismg a second primer comprismg at least 15 consecutive nucleotides of one of the 
sequences of SEQ ID NOs 38-185 and a third primer having a sequence therein which is 
included withm the sequaice of said first primer; 

performing a first polymerase chain reaction with said first pair of nested primers to 
1 5 generate a first PGR product; ' 

contacting said first PGR product with a second pair of primers, said second pair of 
primers comprising a fourth primer, said fourth primer comprising at least 15 consecutive 
nucleotides of said sequence (^one of SEQ ID NO:s 38-185 . and a fifth primer, said fourth 
and fifth primers bemg capabl^ of hybridizing to sequences within said first PGR product; and 
performing a secont^/ polymerase chain reaction, thereby generating a second PGR 
product. / 

22. An isolated or purified cDNA encoding a human secretory protein, said 
human secretory protein comprising the protein encoded by one of SEQ ID NOs 38-185, or a 
fragment thereof of at least 10 amino acids, said cDNA being obtainable by the method of 

25 Glaim21. / 

23. The cDNA of Glaim 22 wherein said cDNA comprises the full protein coding 
sequence partially included in one of the sequences of SEQ ID NOs: 38-185. 

24. The method of Claim 1 8 wherein the second cDNA strand is made by: 
contacting said first cDNA strand with a second primer comprising at least 15 

30 consecutive nucleotide^ of the sequences of SEQ ID NOs: 38-185; 

hybridizing sai^ second primer to said first strand cDNA; and 
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extending said hybridized secondprimer to generate said second cDNA strand. 

25. An isolated or purified cDNA encoding a human secretory protein, said 
human secretory protein comprising the piptein partially encoded by one of SEQ ID NOs 38- 
185 or comprising a fi^agment thereof of at least 10 amino acids, said cDNA being obtainable 

5 by the method of Claim 24. \ 

26. The cDNA of Claim 25, wl^erein said cDNA comprises the full protein coding 
sequence partially included in of one of the sequences of SEQ ID NOs: 38-185. 

27. A method of making a protein comprising one of the sequences of SEQ ID 
NO: 186-333, comprising the steps of 

10 obtaining a cDNA encoding the fiill protein sequence partially included in one of the 

sequences of sequence of SEQ ID NO: 38-185; 

inserting said cDNA in an expression vector such that said cDNA is operably linked 
to a promoter, 

introducing said expression vector into a host cell whereby said host cell produces the 
1 5 protein encoded by said cDNA; and 
isolating said protein. 

28. An isolated protein obtainable by the method of Claim 27. 

29. A method of obtaining a promoter DNA comprising the steps of 
obtaining DNAs located upstream of the nucleic acids of SEQ ID NO: 38-185 or the 

20 sequences complementary thereto; 

screening said upstream DNAs to identify a promoter capable of directing 
transcription initiation; and 

isolating said DNA comprising said identified promoter. 

30. The method of Claim 29, wherein said obtaining step comprises chromosome 
25 walking fi-om said nucleic acids of SEQ ID NO: 38-1 85 or sequences complementary thereto. 

3 1 . The method of Claim 30, wherein said screening step comprises inserting said 
upstream sequences into a promoter reporter vector. 

32. The method of Claim 30, wherein said screening step comprises identifying 
motifs in said upstream DNAs which are transcription factor binding sites or transcription 

30 start sites. 

33 . An isolated promoter obtainable by the method of Claim 32. 
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34. An isolated or purifiedprotein comprising one of the sequences of SEQ ID 

NO: 186-333. I 

f. 

35. In an airay of discret^jESTs or fiagments thereof of at least 15 nucleotides in 
length, the improvement comprising iiclusion in said anay of at least one of the sequences of 

5 SEQ ID NOs: 38-185, or one of the/sequences complementa^ to the sequences of SEQ ID 
NOs: 38-185, or a fragment thereof if at least 15 consecutive nucleotides. 

36. The array of Claim 35 including therein at least two of the sequences of SEQ 
ID NOs: 38-185, the sequences coniplementaiy to the sequences of SEQ ID NOs: 38-185, or 
fragments thereof of at least 1 5 consecutive nucleotides. 

10 37. The airay of CIain/35 including therein at least five of the sequences of SEQ 

ID NOs: 38-185, the sequences cJmplementaiy to the sequences of SEQ ID NOs: 38-185 or 
fragments thereof of at least 15 cinsecutive nucleotides. 
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